ON THE DIAMETER OF PERMUTATION GROUPS 
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Abstract. Given a finite group G and a set A of generators, the diameter 
diam(r(G', A)) of the Cayley graph T{G,A) is the smallest I such that every 
element of G can be expressed as a word of length at most I in AVJ . We are 
concerned with bounding diam(G) ~ maxyi diam(r(G', A)). 

It has long been conjectured that the diameter of the symmetric group of degree 
n is polynomially bounded in n, but the best previously known upper bound was 
exponential in yjnlogn. We give a quasipolynomial upper bound, namely, 

diam(G) = exp (0((log n)"' log log n)) — exp ^(log log jGj)'''^' 

for G = Sym(n) or G = Alt(n), where the implied constants are absolute. This 
addresses a key open case of Babai's conjecture on diameters of simple groups. 
By standard results, our bound also implies a quasipolynomial upper bound on 
the diameter of all transitive permutation groups of degree n. 



1. Introduction 

1.1. Groups and their diameters. Let vl be a set of generators for a group G. 
The (undirected) Cayley graph T{G, A) is tiie graph whose set of vertices \sV = G 
and whose set of edges \s E = {{g,ga} : g £ G,a & A}. The diameter diam(r) of a 
graph T(y, E) is defined by 

(1.1) diam(r) = max min length(P). 

v\,V2€V P a path 

from vi to V2 

In particular, the diameter of a Cayley graph r(G, A) is the maximum, for g £ G, of 
the length i of the shortest expression g = a^^a2^ ' ' ' '^T ^ith Ui £ A and Ei £ { — 1, 1} 
for each i = We may define the diameter diam(G) of a finite group to be 

the maximal diameter of the Cayley graphs T{G, A) for all generating sets A of G. 

Much recent work on group diameters has been motivated by the following con- 
jecture: 

Conjecture 1. {Babai, published as \BS92\ Conj. 1.7]) For all finite simple groups 
G, 

diam(G) < (log|G|)^W, 
where the implied constant is absolute. 

Here and henceforth, l^l denotes the number of elements of a set S. 

The first class of finite simple groups for which Conj. [T] was established was 
PSL2(Z/pZ) with p prime, by Helfgott |Hel08| . The paper [HelOSj initiated a period 
of intense activity [BGOSaj . [BGOSbj . [Dm], |BGSin| . [HelTI] . pH^ . |Va^ . [BGS] . 
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[PS] . [BGT] . [GHb] . [SGVpl on the diameter problem and the related problem of 
expansion properties of Cayley graphs. 

As far as work in this vein on the diameter of finite simple groups is concerned, the 
best results to date are those of Pyber, Szabo [PS] and Breuillard, Green, Tao [BGTj . 
Their wide-ranging generalisation covers all simple groups of Lie type, but (just like 
[GHaj ) the diameter estimates retain a strong dependence on the rank; thus, they 
prove Conj . [T] only for groups of bounded rank. The problem for the alternating 
groups remained wide openo 

These two issues are arguably related: product theorems (of the type |j4- A- j4| » 
1^1 ^"'"'^ familiar since [Hel08] ) are false both in the unbounded-rank case and in the 
case of alternating groups, and the counterexamples described in both situations in 
[PPSS| . [PS| are based on similar principles. 

In the present paper we address the case of alternating (and symmetric) groups. 
We expect that some of the combinatorial difficulties we overcome will also arise in 
the context of linear groups of large rank. 

For G = Alt(n), Conj . (D stipulates that diam(Alt(n)) = n'^(^); [BS92j refers to 
this special case of Conj. [T] as a "folklore" conjecture. Indeed, this has lo ng been a 
problem of interest in computer science (see [ KMS84 j, [McK84] . [BIIK+ 90]. [BBS04], 
[BH05) ). On a more playful level, bounds on the diameter of permutation groups 
are relevant to every permutation puzzle (e.g., Rubik's cube). 

The best previously known upper bound on diam(G) for G = Alt(n) or G = 
Sym(n) was more than two decades old: 

(1.2) diam(G) < exp((l + o{l))^/n\ogn) = exp((l + o(l)) ^log |G|), 
due to Babai and Seress [BS88) . (We write exp(x) for e^.) 

1.2. Statement of results. Recall that a function f{n) is called quasipolynomial if 
log(/(n)) is a polynomial function on logn. Our main result establishes a quasipoly- 
nomial upper bound for diam(Alt(n)) and diam(Sym(n)). 

Main Theorem. Let G = Sym(n) or Alt(n). Then 

diam(G) < exp (0((log n)^ log log n)) , 

where the implied constant is absolute. 

The quasipolynomial bound extends to a much broader class of permutation 
groups. Recall that a permutation group G acting on a set ft is called transitive if 

Va, /3 G $7 3g £ G such that g takes a to /3. 

The size of the permutation domain is called the degree of G. 

Kornhauser et al [KMS84) and McKenzie [McK84| raised the question what classes 
of permutation groups may have polynomial diameter bound in their degree. A 



This list is not meant to be exhaustive. 
^See, e.g., I. Pak's remarks (made already before [PS) . [BGT| ) on the relative difficulty of the 
work remaining to do in the linear case (to be finished "in the next 10 years") and of the problem 
on Alt(n), for which there was "much less hope" [Pak^ . 
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weaker, quasipolynomial bound for all transitive groups was formally conjectured in 
[BS92] : 

Conjecture 2. ( jBS921 Conj. 1.6]) If G is a transitive permutation group of degree 
n then diam(G) < exp((logn)'^'-"'^^). 

Babai and Seress |BS92] linked Conj. [2] to the diameter of alternating groups: 

Theorem 1.1. ( jBS921 Thm. 1.4]) If G is a transitive permutation group of degree 
n then 

diam(G) < exp (O(logn)^) diam (Alt(A;)) , 

where Alt(A;) is the largest alternating composition factor of G. 

Combining our Main Theorem with Thm. II. 1^ we immediately obtain 

Corollary 1.2. Conjecture\^is true; indeed the diameter of any transitive permu- 
tation group G of degree n is 

diam(G) < exp (0((log n)^ log log n)) . 

We note that Thm. fTTTj is not only used to prove Cor. 11.21 - it is also an important 
ingredient in the proof of the Main Theorem (see Lemma 16. 4p . 

It is well-known that, for any finite group G and any set of generators A of G, 
the eigenvalues Aq > Ai > A2 > . . . of the adjacency matrix of r(G, A) satisfy 

(^•^^ Ao - Ai > ^—^^i^-^. 

(See |DSC93[ Cor. 1] or the references |Ald87| . |Bab91] . |Gan91j . |Moh91j therein.) 
Because of ()1.3p . we obtain immediately that 

Ao-Ai > exp(-0((logn)'*loglogn)), 

with consequences on expansion and the mixing rate (see, e.g., |Lov96j ) . 

Finally, the Main Theorem and Cor. 11.21 extend to directed graphs. Given G = 
{A), the directed Cayley graph V{G^A) is the graph with vertex set G and edge set 
{ig,ga) : g e G,a e A}. The diameter of f(G,A) is defined by PTT]) . where "path" 
should be read as "directed path"; diam(G) is the maximum of diam(r(G, ^)) 
taken as A varies over all generating sets A of G. Thanks to Babai's bound 
diai^(G) = O (diam(G) • (log|G|)2) |Babn61 Cor. 2.3], valid for ah groups G, we 
obtain immediately from Cor. 11.21 that 

Corollary 1.3. Let G be a transitive group on n elements. Then 



diam(G) < exp(0((logn) log log n)). 

1.3. General approach. An analogy underlies recent work on growth in groups: 
muctH of basic group theory carries over when, instead of subgroups, we study sets 



■^Or at least results on subgroups that rely on grosso modo quantitative arguments. (Crucially, 
the orbit-stabiliser theorem carries over fLem. ITT]) : Sylow theory, which is quantitative but relies on 
(necessarily delicate) congruences, does not.) As [BBSlMl Lem. 2.1] (in retrospect) and Prop. [F!2l in 
the present work make clear, probabilistic arguments in combinatorics can also carry over, provided 
that the desired probability distribution on a set can be approximated quickly by the action of a 
random walk. 
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that grow slowly (|^ • A • ^| < l^l"*^"*"^). This realisation is clearer in [Helllj than in 
|Hel08| . and has become current since then. (The term "approximate group" |Tao08] 
actually first arose in a different context, namely, the generalisation of some argu- 
ments in classical additive combinatorics to the non-abelian case. (See also |Hel08t 
§2.3], |SSV05. Lem. 4.2].) The analogy between subgroups and slowly growing sets 
was also explored in a model-theoretic setting in later work by Hrushovski |Hruj .) 

This analogy is more important than whether one works with approximate sub- 
groups in Helfgott's sense {\A ■ A ■ A\ < \A\^~^^) or Tao's sense ( |Tao08t Def. 3.7]; 
the two definitions are essentially equivalent, and we will actually work with nei- 
ther. We could phrase part of our argument in terms of statements of the form 
l^'^l ^ l^l^"*"^) but k would sometimes be larger than n; applying the tripling lemma 
( |RT85| . |Hel081 Lem. 2.2], |Tao081 Lem. 3.4]) to such statements would weaken 
them fatally. 

There is another issue worth emphasising: the study of growth needs to be relative. 
We should not think simply in terms of a group acting on itself by multiplication 
- even if, in the last analysis, this is the only operation available to us. Rather, 
growth statements often need to be thought of in terms of the action of a group G 
on a set X, and the effect of this action on subsets A C. G, B C X . (Here X may or 
may not be endowed with a structure of its own.) This was already clear in [HellH 
Prop. 3.1] and [GHbj . and is crucial here: a key step will involve the action of a 
normaliser Ng{H) on a subgroup H < G hy conjugation. 

Our debt to previous work on permutation groups is manifold. It is worthwhile to 
point out that some of our main techniques are adaptations for sets of classification- 
free argument^ on the properties of subgroups of Sym(n) by Babai |Bab82j . Pyber 
|Pyb93| , Bochert |Boc89j . and Liebeck ( Lie83j . Of particular importance is Babai 
and Pyber's work on the order of 2-transitive groups [Bab82j . |Pyb93| . 

We shall also utilise previous diameter bounds. Besides Thm 11.11 we shall use 
the main idea from [BS88j (see Lemma l3.20p and the following theorem by Babai, 
Beals, and Seress. For a permutation g of a set the support supp((7) is the subset 
of elements of O that are displaced by g. 

Theorem 1.4. ( }BBS04| ) For every e < 1/3 there exists K{e) such that, if G = 
Alt(n) or Sym(n) and A is a set of generators of G containing an element x G A 
with 1 < I supp(x)| < en, then 

diam(r(G, A)) < K{e)n^. 
We will use this theorem repeatedly in ^ 

We note that until recently Theorem 11.41 gave the largest known explicit class of 
Cayley graphs of Sym(n) or Alt(n) that has polynomially b ounded diameter. In late 
2010, partly based on ideas from |BBS04j . Bamberg et al jBGH+j proved that if a 
set of generators of Sym(n) or Alt(n) contains an element of support size at most 
0.63n then the diameter of the Cayley graph is bounded by a polynomial of n. 

^Ci. the role of [LP11| (esp. Thm. 4.2, Thm. 6.2), which, in order to provide alternatives to 
the Classification of Finite Simple Groups, did (both more and less generally) for subgroups what 
[Helm §5] did for sets, and was later translated back to sets for use in [BGT] , 
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1.4. Outline. We begin with some general results on growth in groups, presented 
in terms of actions ( §3.11 - [3131) . We then show that large subsets of Alt(n) generate 
a copy of Alt(A), |A| » n, quickly ( §3.4p . This enables us to construct such a copy 
starting from a stabiliser chain with long orbits ( §3.5p . (A stabiliser chain is a nested 
sequence of pointwise stabilisers.) We also note that a group projecting onto Alt(A) 
(A not tiny) must contain an element of small support ( §3.6p . 

Section |4] uses random walks to show that, if we are given a set of generators A 
for Alt(n), we can construct a bounded-size set of generators that are short words 
on A (Prop. SSI Cor. liTfj) . 

Section [5] is of particular importance. It shows how to construct stabiliser chains 
with long orbits. The idea is to adapt Babai's splitting lemma [Bab82] from groups 
to sets. 

Section [6] contains the core of the proof and, in particular, the main procedure 
that gets iterated (Prop. [63]) . We are given a stabiliser chain with long orbits; the 
procedure extends the chain, whose number of elements m is increased by at least 
^ m(logm)/(logn) new elements. 

We now give a sketch of the procedure. (Some definitions are simplified.) We 
can assume the last set in the stabiliser chain - namely, a pointwise stabiliser B~ = 
~ generates an alternating group, as otherwise we can descend to a smaller 
group and use induction (Lemma l6.4l based on §3.61 and Thm. fLTI) . We have a small 
set of generators for {B~) (by We let the setwise stabiliser = A{Q,^....,a^} 
(which is large, by Lemma [3.19p act on these generators by conjugation; by an orbit- 
stabiliser principle, either (a) an element of B^ centralises {B~) (and thus has small 
support, and we are done by Thm. II. 4p or (b) one of the orbits is large. In the 
latter case, we get many new elements in the pointwise stabiliser of (tti, . . . ,0^77^). 
We proceed to organise them in a stabiliser chain (by repeated applications of 
thereby extending the chain we started with. 

1.5. Acknowledgements. We are deeply grateful to both Pablo Spiga and Nick 
Gill for stimulating discussions and for their constant help. Gordon Royle organised 
the first author's visit to Australia; if it were not for him, our collaboration might 
not have happened. Thanks are also due to Laszlo Babai, Martin Kassabov, Igor 
Pak, Peter Sarnak and Andrzej Zuk for their advice. 

Akos Seress was supported in part by the NSF and by ARC Grant DP1096525. 
Travel was supported in part by H. A. Helfgott's Philip Leverhulme prize. We 
benefited from the kind hospitality of the University of Western Australia and the 
Ecole Normale Superieure during our visits to each other's institutions. 

2. Notation 

We write [n] = {1, 2, . . . , n}. For a set fi, Sym(r2) and Alt(r2) are the symmetric 
and alternating groups acting on VL. As is customary, we often write Alt(n) and 
Sym(n) for Alt([n]) and Sym([n]) - particularly when we are thinking of these groups 
as abstract groups as opposed to their actions. 

We write H < G to mean that is a subgroup of G and H <\ G to mean that 
H is a normal subgroup. We say that a group 5 is a section of a group G if there 
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exist subgroups H and K oi G with K <\H and H/K = S. We denote the identity 
element of a group by e. 

Let ^ be a subset of a group G. We write = {a~^ : a A}, A'^ = {0102 ■ ■ ■ a-k '■ 
ai,... ,ak € A}. In |Hel08j . [Helllj . the first author wrote Ai to mean {A U A~^ U 
{e})^; this does not seem to have become standard, and would also not do here due 
to the potential confusion with alternating groups. (Recall that An is in common 
usage as a synonym for Alt(n).) We will often include A = A~^, e £ A explicitly 
in our assumptions so as to simplify notation. A set A with A = A~^ is said to be 
symmetric. 

We write \A\ for the number of elements of a set A. (All of our sets and groups 
are finite.) Given a group G and a subgroup H < G, we write [G : H] for the index 
of H in G. 

Let a group G act on a set X. As is customary in the study of permutation 
groups, given g € G and a € X, we write for the image of a under the action 
of g. We speak of the orbit = {a^ : g S A} of a point a under the action of a 
set A of permutations. Our actions are right actions by default: (a^)^ = a^^. In 
consequence, we also use right cosets by default, i.e., cosets Hg (and so G/H is the 
set of all such cosets). Clearly \G/H\ = [G : H]. 

We define the commutator [g,h] by [g,h] = g^^h^^gh. Again, this choice is 
customary for permutation groups. 

Define 

As = {g £ A : = T,}, (the setwise stabiliser) 

^(S) = {5 ^ ^ • G S (a^ = q)}. (the pointwise stabiliser) 

Given a permutation g £ Sym(r2), we define its support supp(5) to be the set of 
elements of moved by g: supp(5) = {a £ : a}. If a subset A C is 

invariant under g, i.e., A is a union of cycles of g, then we define g\A ^ Sym(A) as 
the restriction (natural projection) of g to A: the permutation g\j\ acts on A as 
does. If A is invariant under some D C Sym(O) then D\j\ = {g\A '■ g G D}. 

A partition B = {Qi,Q2, ■ ■ ■ , ^fe} of a set Q is called a system of imprimitivity for 
a transitive group G < Sym(r2) if G permutes the sets Oj, for 1 < i < k. A transitive 
group G < Sym(r2) is called primitive if there are only the two trivial systems of 
imprmitivity for G: the partition into one-element sets, and the partition consisting 
of one part Qi = Q. 

We say that a graph (or a multigraph) is regular with degree or valency d if every 
vertex is connected to d others; that is, "degree" and "valency" of a vertex mean the 
same thing. In a directed graph, the out-degree of a vertex x is the number of edges 
starting at x while the in- degree is the number of edges terminating at x. A directed 
graph is called strongly connected if for any two vertices x, y, there is a directed path 
from X to y. 

By fin) <C g{n), g[n) ^ f{n) and f{n) = 0{g{n)) we mean one and the same 
thing, namely, that there are > 0, C > such that f{n) < C ■ \g{n)\ for all n > N. 



3. Preliminaries on sets, groups and growth 
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3.1. Orbits and stabilisers. The orbit-stabiliser theorem from elementary group 
theory carries over to sets. This is a fact whose importance to the area is difficult 
to overemphasise. It underlies already [H el08] at a key point (Prop. 4.1); the action 
at stake there is that of a group G on itself by conjugation. 

The setting for the theorem is the action of a group G on a set X. The stabiliser 
Gx of a point x €z X is the set {g £ G : = x}. Recall that we write x^ for g{x) 
and x^ for the orbit {g{x) : g G A}. 

Lemma 3.1 (Orbit-stabiliser theorem for sets). Let G be a group acting on a set 
X . Let X £ X , and let A <^ G be non-empty. Then 

(3.1) \AA-'nGx\>j^. 
Moreover, for every B (1 G, 

(3.2) \AB\>\Ar\Gx\\x^\. 

The usual orbit-stabiliser theorem is the special case A = B = H , H a subgroup 
of G. 

Proof. By the pigeonhole principle, there exists an image x' G x"^ such that the 
set S = {a £ A : x"" = x'} has at least elements. For any a, a' S S, 

^a(a')-i = (xO^""')"' = X. Hence 

1^1 



1^^-^ nG^I > 1^5-^1 > \s\ > 



\x 

Let ^1,62, . . . ,b£ £ B, £ = \x^\, be elements with x^' 7^ x^^ for i ^ j. Consider 
all products of the form abi, a £ ACi Gx, ^ < i < i. If two such products abi, a'bi' 
are equal, then x^' = x"'^'- = = x^^. This implies bi = bii . Since abi = o^'bi', 

we conclude that a = a'. We have thus shown that all products abi, a G An Gx, 
1 < i < i, are in fact distinct. Hence 

\AB\ > \{AnGx)-{bi : 1 <i <^}| 
= \AnGx\-i= \AnGx\ ■ \x^\. 

□ 

As the following corollaries show, the relation between the size of A, on the one 
hand, and the size of orbits and stabilisers, on the other, implies that growth in the 
size of either orbits or stabilisers induces growth in the size of A itself. 

Corollary 3.2. Let G be a group acting on a set X . Let x £ X . Let A <^ G be a 

non-empty set with A = A^^. Then, for any k > 0, 

(3.3) l^-'l > 

Proof By (fOl . 

\A'+'\ > lA'^ n G.llx^l > ^J^l^' n G.llx^l. 
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Since \A^ n G.j,\\x^\ > \A\ (by ([31])), we obtain □ 

Corollary 3.3. Let G he a group acting on a set X. Let x €z X . Let A G be a 
non-empty set with A = A~^. Then, for any k > 0, 

(3.4) > ^^1^1. 



Proof. By 1^ and 

□ 



3.2. Lemmas on subgroups and quotients. We start by recapitulating some of 
the simple material in [HelTi] §7.1]. The first lemma guarantees that we can always 
find many elements of AA~^ in any subgroup of small enough index. 

Lemma 3.4 ( |Helll[ Lem. 7.2]). Let G he a group and H a subgroup thereof. Let 
A G be a non-empty set. Then 

(3.5) \AA^^nH\ > 



r 



where r is the number of cosets of H intersecting A. In particular, 

\A\ 



lAA'^nHl > 



[G-.H]- 

Proof. By the orbit-stabiliser principle (j3.ip applied to the natural action of G on 
G/H hy multiplication on the right (Set x = He = H.) □ 

The following two lemmas should be read as follows: growth in a subgroup gives 
growth in the group; growth in a quotient gives growth in the group. 

Lemma 3.5 ( |Hellll Lem. 7.3]). Let G be a group and H a subgroup thereof. Let 
A C G be a non-empty set with A = A^^ . Then, for any k > 0, 

(3.6) M-'l > 

Proof. By Cor. 13.21 applied to the action of G on G/H by multiplication on the right 
(with X = He = H). □ 

For a group G and a subgroup H < G, we define the coset map ttq/jj : G ^ G/H 
that maps each g ^ G to the right coset Hg containing g. 

Lemma 3.6. (essentially |Helll[ Lem. 7.3]) 

Let A G be a non-empty set with A = A~^ . Then, for any k > 0, 



-'Recall that we are following the convention that G/H is the set of right cosets Hg. 
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Proof. By Cor. 13.31 applied with G acting on X := G/H by multiplication on the 
right and with x := H seen as an element of G/H. □ 

The following lemma is a generalisation of Lemma 13.41 

Lemma 3.7. Let G be a group, let H,K be subgroups of G with H < K, and let 
AC. G be a non-empty set. Then 

In other words: if A intersects r[G : H] cosets of H in G, then AA~^ intersects 
at least r[G : H]/[G : K] = r[K : H] cosets of H in K. (As usual, all our cosets are 
right cosets.) 

Proof. Since A intersects \TrQ/H{A)\ cosets of if in G and |7rG'/;^(^)| cosets of K in 
G, and every coset of in G is a disjoint union of cosets of H in G, the pigeonhole 
principle implies that there exists a coset Kg of K such that A intersects at least 
k = \TrQ/H{A)\/\TTQ/x{A)\ cosets Ha C Kg. Let ai,...,ak be elements of A in 
distinct cosets of H in Kg. Then OiO^^ G AA^^ n K for each i = 1, . . . ,k. Finally, 
note that Haia^^, . . . , HakO^^ are k distinct cosets of H. □ 

Lastly, a result of a somewhat different nature. It is a version of Schreier's lemma 
(rewritten slightly as in [GHbl Lem. 2.10]). 

Lemma 3.8 (Schreier). Let G be a group and H a subgroup thereof. Let A Q G 
with A = A^^ and e £ A. Suppose A intersects each coset of H in G. Then A^ Ci H 
generates {A) D H . Moreover, (A) = {A^ riH)A. 

Proof. Let C C A be a full set of right coset representatives of H. We wish to 
show that (A) = {A^ n H)C. (This immediately implies both (A) = {A^ n H)A and 

{A)nH = {A^nH).) 

Clearly e € {A^ D H)C. It is thus enough to show that, if g = he, where h E 
{A^ n H) and c € C, and a' G A, then ga' still lies in {A^ n H)G. This is easily 
seen: since C is a full set of coset representatives, there is a c' S G with c' = h'ca' 
for some h' £ H, and thus 

ga' = hca' = h{{h')-^)h'ca' = h{{h')-^)c' G {A^ D H){A^ D H)C = {A^ n H)G, 

where we use the fact that h' = c' {a')^^c^'^ £ A^ . □ 

3.3. Actions and generators. The proofs of the next two lemmas share a rather 
simple idea. 

Lemma 3.9. Let G be a group acting transitively on a finite set X. Let A C G with 
A = A'"^ and G = {A). Then, for any x € X, 



x^' = X, 



where i = \X\ 
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Proof. Consider the orbits {x} C x"^ C Q ■ ■ ■ ■ Let i' be the smahest integer 

with x^*^ """^ = x"^^ . As x^' = (x^'^"''^)'^ = {x'^^ = x'^^ = x"^*^ , we have 

x^' = x^^^ = x^ = X. Since 

{x} C x^ C x^' C . . . C x^'' = X, 

wehavef<|X|. □ 

Lemma 3.10. Let G be a group acting transitively on a finite set X . Let A G 
with A = A~^ and G = {A). Then there is a subset A' C A, \A'\ < \X\, such that 
{A') acts transitively on X . 

Proof. Let x £ X. Let Ai = {g}, where g is any element of A such that x^ 7^ x. For 
each i > 1, let Aj+i be Ai U {gi}, where gi is an element of A such that x^'^*^''^^'-'''* 2 
j;(^i>. If no such element gi exists, we can conclude that x^"^'^ is taken to itself by 
every gi G A. This implies that x^"^*^ is taken to itself by every product of elements 
of A, and thus (x^^'^)^^^ = x^^^ equals x^^^K 
Hence, we have a chain 

{x} C x<^^> C x<^2> C . . . C x<^'> = x<^> = X. 

Clearly i<\X\- 1, and so \Ai\ < \X\ - 1. Let A' = Ai. □ 

3.4. Large subsets of Sym(n). Let us first prove a result on large subgroups of 
Sym(n). 

Lemma 3.11. Let n > 84. Let G < Sym(n) be transitive, with a section isomorphic 
to Alt{k) for some k > n/2. Then G is either Alt(n) or Sym(n). 

Proof. Since k > 5, the group Alt(A;) is simple. Hence some composition factor of 
G has a section isomorphic to Alt (A:). Assume that G is imprimitive and let B be 
a non-trivial system of imprimitivity for G. Write b = \B\ and m = n/b and let K 
be the kernel of the action of G on B. Since G/K is isomorphic to a subgroup of 
Sym(5), K is isomorphic to a subgroup of Sym(m)^ and b,m < k, we obtain that G 
has no section isomorphic to Alt(/c), a contradiction. This shows that G is primitive. 

From jPSSnj . we obtain that either G > Alt(n) or |G| < 4". Since \G\ > 
I Alt(/c)| = k\/2 > [n/2]!/2, a direct computation shows that the latter case arises 
only for n < 84. □ 

Our aim for the rest of this subsection will be to show that, if ^ C Sym(n) is 
very large, then A""^ ^' contains a copy of Alt (A), |A| > n/2. The next lemma 
generalizes Bochert's theorem [Boc89] . |DM961 Thm. 3.3B] to subsets. Recall that, 
for g £ Sym(n), we define the support of g by supp(5) = {a £ Q : ^ a}. 

Lemma 3.12. Let n > 5. Let A C Sym([n]) with A = A~^ . Lf (A) is a primitive 
permutation group and \ A\ > n!/([n/2j!), then A"' is either Alt([n]) or Sym([n]). 

Proof. Given A C Sym([n]) as in the statement of the lemma, let k be the smallest 
integer such that there exists A C [n] with |A| = A: and (^^)(a) = {^j- Let A be 
one such set. 
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Suppose that k < n/2. Then Sym([n])(A) has n\/{n — k)\ < \A\ cosets in Sym([n]). 
Thus, by the pigeonhole principle, there exist two distinct elements a and 6 of A in 
the same coset. Hence ab~^ G Sym([n])(A), that is, ab~^ G (^^)(A)- This contradicts 
the definition of k. We conclude that k > n/2. 

The set Q = [n] \ A has cardinality less than k, so by definition there exists 
g G with 5 / e. Let (5 G A with 6^ 7^ 6. As the set A \ {6} has cardinality 

less then k, by the definition of k, there exists h G (^^)(a\{(5}) with h ^ e. Then 
supp(/i) C U {5}. Necessarily, 5 G supp(/i), otherwise (A^)(a) contains the non- 
identity element h. Hence supp((7)nsupp(/i) = {5} and so the commutator x = [g, h\ 
is a 3-cycle. Note that [g, h] G A®. 

Now, since {A) is primitive and contains a 3-cycle, by Jordan's theorem |DM96[ 
Thm. 3.3A] we obtain that {A) > Alt([n]). In particular, {A) is 3-transitive, and 
thus its action by conjugation on the set X of all 3-cycles is transitive. By Lemma 

ESI 

x^' = X, 

where I = \X\ = n{n — l){n — 2)/3 and A^ acts on x by conjugation. Thus 

^n(n-l)(n-2)/3[^^;jj^n(n-l)(n-2)/3 

contains all 3-cycles in Alt([n]). 

Since any element of Alt([n]) can be written as a product of at most \n/2\ 3-cycles, 
we obtain that A^ ~^ contains Alt([n]). Also, if A contains an odd permutation, 
then A""^ = Sym([n]). □ 

What happens, however, if {A) is not transitive, let alone primitive? We shall see 
first that, if A is large, then {A) must have at least a large orbit. In the following 
two lemmas, we use the inequalities 

(3.7) (^)"<„,<3V5(=)", 

valid for all n > 1. 

Lemma 3.13. Let A C Sym(n) with \A\ > (F"n\, for some number d with 0.5 < d < 
1. If n is greater than a bound depending only on d, then {A) has an orbit of length 
at least dn. 

Proof. Let k := [dn\. Suppose that |^| > d'^nl and that the longest orbit length 
of {A) is less than dn. We shall prove that n is bounded from above by a function 
depending only on d, from which the lemma follows. We start by claiming that 
\{A)\ < k\{n — k)\. To prove the claim, we define a sequence of groups of non- 
decreasing order, starting with (A) and ending with Sym(/c) x Sym(n — k). First, 
we replace {A) by a direct product of symmetric groups, acting on the same orbits 
as A. Then, if the length of the longest orbit Oi is |0i| < k, then we add a point 
a to Oi from some other orbit O2 and replace Sym(Oi) x Sym(02) by Sym(Oi U 
{a}) X Sym(02 \ {"}) in our direct product. Finally, if the largest orbit length is k 
then we merge all other orbits into one. 
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Now, by ()3.7p . we have the fohowing inequahties: 



n 



) (e)" ^ (n) ^■^d''n\<\A\<\{A)\<k\{n-k)] 



(3.8) < 9y/k{n-k) ( - J (— ^) < ^^^- 



A;^^ /n-/fc\""'' 9 k^{n-kY-^ 



Simphfying the left-hand side together with the right-hand side, we obtain k'^ < 
\n{n - kY~'^, that is, (^^^^ < f 

We define c := (j^Y '^- As 



hm = c > 1, 

n^oo \ n — k 



for large enough n, depending only on d, we have ^^^4^^ > (^T^)'^- However, 

(iy^)" < |n is false if n is greater than a bound depending only on d, proving our 
claim. □ 

Using Bochert's theorem |Boc89 j . Liebeck [Lie83| derived a result on large sub- 
groups of Sym(n) that does not assume transitivity or primitivity; the following 
lemma is a generalisation to sets. Note that we use only elementary counting argu- 
ments, whereas |Lie83] is based on a detailed examination of the subgroup structure 
of Sym(n). 

Lemma 3.14. Let d he a number with 0.5 < d <1. If A C Sym([n]) (with A = A~^) 
has cardinality \A\ > d"n! and n is larger than a hound depending only on d, then 
there exists an orhit A C [n] of {A) such that |A| > dn and {A^ )|a is Alt (A) or 
Sym(A). 

Proof. By Lemma 13.131 for large enough n the group B = {A) has an orbit A of 
length k > dn. Write p = k/n and note that d < p < 1. The group G = B\/\ 
has order at least d^n\/{n — k)\, so estimating k\{n — k)\ from above as in (j3.8p and 
estimating n! from below by p.7p . we obtain 

, ^ k\{n-k)\ 9 k'^jn-ky-^ _ 

Next, we show that for large values of n the transitive group G cannot be im- 
primitive. Indeed, if G is imprimitive, then using ()3.7p we have 

If k \ 1 (!)^ 1 



(3.10) [Sym(A) : G] > - , , , , > ^^f- > 2^^^. 



A direct computation shows that the function /(p) = 2^/ Pp{l — p)^^ p^I p is monotone 
increasing in the interval [1/2,1) with supremum 2. Hence, comparing the upper 
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and lower bounds for [Sym(A) : G] deduced in ()3.9p and ()3.10p . we obtain 

9 / 1 1 

(3.11) ^n2''" — > 2"". 

^ ^ A \2d) 18n 

As d > 1/2, for large enough n we have {2d)'^ > (18n)(|n) and therefore (|3.1ip 
cannot hold. 

Hence G is primitive and is a set of size at least dP"n\/{n — k)\ > cPkl > 
/c!/([fc/2j)! (where the last inequality holds for n greater than a lower bound de- 
pending only on d). Therefore, by Lemma [3. 121 (^|a)" is either Alt(A) or Sym(A), 
and hence so is (^'''')|a = (^|A)"^ □ 



The following auxiliary lemma will allow us to strengthen the conclusions of 
Lemma I3.14[ It shows that, if a set A contains a copy of Alt in a weak sense, 
A^"' contains a copy of Alt in a strong sense. 

Lemma 3.15. Let n > 13. Let A C [n], |A| > |. Let A C Sym([n])A, with 
A = A^^ and e € A, be such that A is an orbit of {A) and A\^ is Alt(A) or 
Sym(A). Then (^^")([ri,]\A)|A contains Alt(A). 

Proof. It is clear that \A\ > | Alt(A)| > | Sym([n] \ A)|. Thus, by the pigeonhole 
principle, there are /ii,/i2 G A, /ii / /i2, such that /ii|[n]\A = ^2|[n]\A) a-iid so 
g = hih2^ fixes [n] \ A pointwise. 

We show that (^"^^)([n]\A) contains an element g' such that g'\/\ is a 3-cycle. 
If oIa has at least two fixed points then there exists an element h £ A so that 
/i|a is a 3-cycle, with supp(/i|a) intersecting supp((7|a) in exactly one point. Then 
g' = [g,h] £ ^2+1+2+1 _ ^6 f^xes [n] \ A pointwise and i^'Ia is a 3-cycle. If g 
contains a cycle (a/?7(5 . . . ) of length at least 4, then we choose an element h £ A 
with /i|a = (ck/St) and let g' = [g, h] G A^. Then g' fixes [n] \ A pointwise and i^'Ia 
is the 3-cycle (a/35). 

In all other cases, |supp((7|a)| > |A| — 1 > 6 and all nontrivial cycles of g have 
length 2 or 3. Hence g|A contains at least two 3-cycles, or at least two 2-cycles. 
If sIa contains the cycles (a/37) and (Stju) then we choose an element h £ A with 
/i|a = {o':rj){j35^v). A little computation shows that g' = [g, h] fixes [n] \ A pointwise 
and ^'Ia is the 3-cycle {STju). 

Finally, suppose g contains the 2-cycles (a/3) and (7^). We choose again an 
element h £ A with /i|a = (a/37); then supp([(7, /i]) = {a,/3, 7, 5} and [g,h] fixes 
[n] \ A pointwise. Since [g, h] £ A^ also fixes at least two points of A, we deduce 
as in the very first case of our analysis that the commutator g' = [[g, h], h'] with an 
appropriate h' £ A is a 3-cycle. Note that g' £ A^'^^^^^^ = A^^ . 

Given any 3-cycle s in Sym(A), we can conjugate g' by an appropriate element of A 
to get an element of (^^^)([n]\A) whose restriction to A equals s. Now, every element 
of Alt(A) is the product of at most L|A|/2j 3-cycles. Hence (^^^L"/2J |^ 
contains Alt (A). □ 
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We finally obtain the statement we need on large sets A: a power of A (namely, 
A^^^) contains elements acting as the full alternating group on an orbit A, and 
leaving all points outside A fixed. 

Proposition 3.16. Let d be a number with 0.5 < d < 1. Let A C Sym(n) with 
A = A^^ and e ^ A. If \A\ > d^nl and n is larger than a bound depending only on 
d, then there exists an orbit A C [n] of {A) such that |A| > dn and (^*" )([n]\A)|A 
contains Alt (A). 

Proof. Immediate from Lemma 13.141 and Lemma 13.151 where the latter is applied 
with (A"*) A instead of A. □ 

3.5. Bases and stabiliser chains. Given a permutation group G on a set $7, a 

subset E of is called a base if = {e}- This definition goes back to Sims 

|Sim70j . If, instead of G, we consider a subset A of Sym(ri), then, as the following 
lemma suggests, it makes sense to see whether [AA^'^)^^,) (rather than ^(s)) equals 
{e}. 

Lemma 3.17. Let A C Sym(f7), \rL\ = n. // S C 17 is such that (^^~^)(2) = {e}, 
then |S| > log^ \ A\ . 

Proof. Notice first that [Sym(r2) : (Sym(Q))(2)] < ra'^L By the pigeonhole principle, 
if 1^41 > n'^l, then there exists a right coset of (Sym(r2))(2) containing more than 
one element of A, and thus 

|(^^-1)(S)| = \AA-^ n (Sym(17))(s)| > 1. 

Hence, if {AA~'^)(^y.) = {e}, then we have |^| < nl^l, i.e., |S| > log„ \A\. □ 

The use of stabiliser chains H > > H(^^^^^^^ > ■ ■ ■ is very common in com- 
putational group theory (starting, again, with the work of Sims; see references in 
|Ser031 §4.1]). We may study a similar chain A > A^-^ > A^^,^ q,^) > • • • when A is 
merely a set. 

Lemma 3.18. Let T, = {ai, . . . ,0^} ^ [n] and A C Sym([n]). Suppose that 

for all i = 1,2, ... ,m. Then A"^ intersects at least YYlLi cosets of Sym([n])(s) • 

Proof. For each 1 < i < m, write Aj = ' "' *~ ; thus |Aj| > r^. For each 5 G A,, 
pick gs E with af^ = 5 and write Si = {gs ■ S G Aj}. Clearly, IS*,! = |Aj| 

and Si C A. We show that for every two distinct tuples 

(si, S2, . . . , Sm), {Si, S2, . . . , S^) G 5*1 X • • • X Sm 

the products P = SmSm-i ■ ■ ■ si and P' = s^'^m-i ' ' ' •^'i belong to two distinct 
cosets of Sym([n])(5]). From this it follows that A"^ intersects at least jS*!! • • • \Sm\ = 
|Ai| • • • I Ami > DilLi'^j cosets of Sym([n])(s). 

We argue by contradiction, that is, we assume that P and P' map (ai, . . . , Um) 
to the same m-tuple. Let j be the smallest index such that Sj / s'j. Then Q = 
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Ps^ ^ • • • Sj\ and Q' = P's^ ^ ■ ■ ■ Sj^^ = P's'i ^ • • • s'j\ also map (qi, . . . , a-m) to 
the same m-tuple. Note that for all k < m, Sk and s'^ fix (ai, . . . ,ak-i) pointwise. 
Thus 

«i = a/ ^ a/ = a. , 
contradicting our assumption. □ 

We thus see that, if we choose ai, 02, • • • so that the orbits a- a,re large, 

we get to occupy many cosets of (Sym([n]))(s). By Lemma [37fl this will enable us to 
occupy many cosets of (Sym([n]))(2) in the setwise stabiliser (Sym([n]))E. We will 
then be able to apply Prop. [3?T6] to build a large alternating group within Sym($]) = 
(Sym([n]))E/(Sym([n]))(j])- This procedure is already implicit in |Pyb93[ Lem. 3]; 
indeed, what amounts to this is signalled by Pyber as the main new element in his 



refinement Pyb93, Thm. A] of Babai's theorem on the order of doubly transitive 
groups |Bab82 . The main difference is that we have to work, of course, with sets 
rather than groups; we also obtain a somewhat stronger conclusion due to our using 
Prop. [3T6l rather than invoking Liebeck's lemma directly. 

Lemma 3.19. Let A C Sym([n]) with A = A~^ and e & A. Let S = {ai, . . . , am} ^ 
In] be such that 



(3.12) 



> dn 



for all i = 1,2, ... ,m, where d > 0.5. Then, provided that m is larger than a bound 
C{d) depending only on d, there exists A C E with |A| > d\Ti\ and 

Alt(A)C((^16,n«)^)^^^^^|^_ 

Proof. By (I3.12P and Lemma [3. 181 A"^ intersects at least {dn)"^ cosets of Sym([n])(2) 
in Sym([n]). Since 

ro n o n 1 [Sym([n]) : Sym([n])(s)] n'" 

Sym n : Sym( n )s = 7^ t,— rr ^ t,— rr — r < — r, 

[Sym([n])E : Sym([n])(s)] ml 

Lemma [3771 implies (with G = Sym([n]), K = Sym([n])s, H = Sym([n])(2), and A^ 
instead of A) that 

Note that \-k k / h{A^"^ K)\ = |(^^'")s|s|- We can thus apply Prop. EH (with m 
instead of n, and A' = (A^'")s|e instead of A) and obtain that there is a set ACS 
such that |A| > dm and ((^')^'"^)(s\A) I A contains Alt(A). □ 

3.6. Existence of elements of small support. The following lemma is essentially 
[BS871 Lemma 3] (or [BS881 Lemma 1]). 

Lemma 3.20. Let A C [n], |A| > c(logn)^, c > 0. Let H < (Sym(n))A- Assume 
is Alt(A) or Sym(A). 
Let T be any orbit of H. Then, if n is larger than a bound depending only on c, 
H contains an element g with g\A. / 1 and \ supp(g|r)| < |r|/4. 
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Proof. Let pi = 2, p2 = 3,. . . , pk he the sequence of the first k primes, where k is 
the least integer such that pip2 • • - pk > n^- Much as in [BS87] . we remark that, by 
elementary bounds towards the prime number theorem, 

(3.13) 2pi+p2 + ---+Pk<c{\ognf, 

provided that n be larger than a bound depending only on c. Thus H contains an 
element h such that /i| a consists of | A| — (2pi + P2 + ■ ■ ■ + Pk) fixed points and cycles 
of length pi,pi,P2,P3, ■ ■ ■ ,Pk- (We need two cycles of length pi = 2 because we want 
an even permutation on A.) 

We can now reason as in }BS87l Lemma 3] or |BS88l Lemma 1]. For every 7 G F, 
denote by the length (possibly 1) of the cycle of h containing 7 and for i < fc 
define Fj := {7 G F : | k^}. Then 

(3.14) log Pi < |F| logn 

because k.^ < n implies that for all 7 the inner sum is less than logn. Exchanging 
the order of summation, 

k 

7Grpi|K^ i=l 

If |Fi| > |F|/4 for alH < A; then 



k 

E 



^ I Til log Pi > -^log ^n^*^ > ■^log("'^) = |r|logn. 



contradicting (|3.14p . Hence there is a prime p < Pk such that p\k^ for fewer than 
|F|/4 elements 7 of F. Denoting the order of h by we define g = for £ := \h\/p. 
We obtain that |supp((7|r)| < |F|/4. We also have that g is non-trivial, since g\A 
contains a p-cycle. Clearly g £ H, and so we are doneH □ 

4. Random walks and generation 

4.1. Random walks. Let F be a strongly connected directed multigraph with ver- 
tex set V = V{r). For x G V{T), we denote by T{x) the multiset of endpoints of the 
edges starting at x (counted with multiplicities in case of multiple edges). We are 
interested in the special case when F is regular of valency d (i.e., |F(a;)| = d, for each 
X G ^(F)) and F is also symmetric in the sense that for all vertices x,y £ V^(F), the 
number of edges connecting x to y is the same as the number of edges connecting 
y to X. These two conditions imply that the adjacency matrix yl of F is symmetric 
and all row and column sums are equal to d. 

A lazy random walk on F is a stochastic process where a particle moves from 
vertex to vertex; if the particle is at vertex x such that F(x) = {yi, . . . ,yd}, then 
the particle 



^Since we need only the existence of g for the moment, we are not concerned by the fact that / 
is very large. Compare this to the situation in [BS88) . where the use of a large I causes diameter 
bounds much weaker than those in the present paper. 
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• stays at x with probability \; 

• moves to vertex yi with probabihty for alH = 1, . . . , d. 

Here we are concerned with the asymptotic rate of convergence for the probabihty 
distribution of a particle in a lazy random walk on F. For x,y £ V{T), write Pkix, y) 
for the probability that the particle is at vertex y after k steps of a lazy random 
walk starting at x. For a fixed e > 0, the mixing time for e is the minimum value of 
k such that 

^(l-.)<p,(x,y)<^(l + s) 

for all x,y G V{T). 

We can give a crude (and well-known; see, e.g., j BBS041 Fact 2.1]) upper bound 
on the mixing time for regular symmetric multigraphs in terms of = |y(F)|, e 
and the valency d alone. 

Lemma 4.1. Let T he a connected regular multigraph of valency d and with N 
vertices. Then the mixing time for e is at most N'^dlog{N/e). 

Proof. Let A be the adjacency matrix of F. Since A is symmetric, the eigenvalues 
of A are real; moreover, their modulus is clearly no more than d in magnitude. Let 

d = fj.1 > fi2 > • • • > fJ-N > —d 

be the eigenvalues of A and write P = I /2 + A/ 2d, where / is the N x A'^- identity 
matrix. The matrix P is the probability transition matrix for the Markov process 
described by a lazy random walk on F. 

The sum of every row or column of P is 1, i.e., P is a doubly stochastic matrix. 
The eigenvalues of P are 

1 = Ai > A2 > ••• > Aat > 

with Aj = 1/2 + /ij/2d for each i = 1, . . . , A^. It is well-known that the asymptotic 
rate of convergence to the uniform distribution of a lazy random walk is determined 
by A2 (see for example |Lov96| ) : for each vertex y of F and for each k > 1, we obtain 
from |Lov961 Theorem 5.1] that \pk{x,y) — l/N\ < A2. 
By |Fie721 Lemma 2.4 and Theorem 3.4], we have 

A2 < 1-2(1 -cos(7r/iV))^(P), 

where ju(P) = ^^^^jLaicv YlieM jfM Pij ■ As F is a connected regular graph of va- 
lency d, we have fJ,{P) > l/2d. Using the Taylor series for cos(x), we see that 
(1 -cos(7r/iV)) > l/N'^. Thence \pk{x,y) - l/N\ < (1 - l/{N'^d))''. Since the series 
{(1 — l/n)"'}„>i is monotone increasing converging to e~^, for k > N'^d\og{N/e), 
we obtain that \pk{x,y) — 1/N\ < e/N, as desired. □ 

We will generally study regular symmetric multigraphs of the following type. Let 
G be a group and A be a subset of G with A = A~^ and e £ A. Let G act on a set 
X. We take the elements of X as the vertices of our multigraph, and draw one edge 
from X £ X to x' € X for every a G A such that x"" = x'. A walk on the graph then 
corresponds to the action of an element of A^ on an element x of X, where £ is the 
length of the walk and x is the starting point of the walk. 
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Lemma [4. II then gives us a lower bound on how large i has to be for the action of 
on X to have a rather strong randomising effect. This idea was already exploited 
in |BBSn4[ §2], with G = Sym([n]) and X the set of A:-tuples of [n]. 

Lemma 4.2. Let H be a k-transitive subgroup of Sym([n]). Let A be a set of 
generators of H with A = A^^ and e £ A. Then there is a subset A' A with 
A' = {A')^^ , such that, for every e > 0, for any £ > 2n^^ \og{n^ / e) , and for any 
k-tuples X = (xi, . . . , Xk), y = (yi, • • • , Vk) of distinct elements of [n], the probability 
of the event 

for gi, . . . , g£ £ A' (chosen independently, with uniform distribution on A' \ {e} and 
with the identity being assigned probability 1/1) is at least (1 — e) ^' and at most 



(l + e; 



(n-fc) 



1,! 



Proof. Let A be the set of fe-tuples of distinct elements of [n]. Since H acts tran- 
sitively on A and since (A) = H, Lemma 13.101 gives us a subset A' of A with {A') 
transitive on A and with l^'l < |A|. Set Aq = A' U A'~^. Let F be the multigraph 
with vertex set A and with T(x) = {x^ \ a £ Aq} as the multiset of neighbours of 
X for each x S A. Clearly, F is a regular graph of valency |^o| ^ 2|A| and with 
|A| < n'^ vertices. Now the statement follows from Lemma |4. II applied to F. □ 

4.2. Generators. Given A C Sym([n]) such that (A) is Alt([n]) or Sym([n]), how 
long can it take to construct a small set of generators for a transitive subgroup of 
{A)7 This subsection is devoted to answering that question. We start with proving 
two auxiliary lemmas. 

Lemma 4.3. Let A C Sym([n]), e £ A. Assume {A) is transitive. Then there is a 
g £ A^ such that \ supp(g)| > n/2. 

Proof. For each i £ [n], let gi be an element of A moving i. (If no such ele- 
ment existed, then {A) could not be transitive.) Let g = g^^^ g^ . . . g"^ , where 
ri,r2, . . . ,rn £ {0,1} are independent random variables taking the values and 
1 with equal probability. (Such an element g is called a random subproduct of the 
sequence (gi), see, e.g., [Ser03t §2.3] for other applications). 

Let a £ [n] be arbitrary. Let j be the largest integer such that gj moves a. Then 
g moves a if and only if g' = g^ . . . g-^ moves a. Take ri, r2, . . . , rj_i as given. If 

/? = •••^j-i equals a, then g' moves a if and only if = 1; this happens with 
probability 1/2. If /3 7^ a, then g' certainly moves a if =0, and thus moves a 
with probability at least 1/2. Thus g moves a with probability at least 1/2. 

Summing over all a, we see that the expected value of the number of elements of 
[n] moved by g is at least n/2. In particular, there is a g' G A^ moving at least n/2 
elements of [nl. □ 



The proof of the following lemma may seem familiar; its basic idea is common in 
sphere-packing arguments. 
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Lemma 4.4. For a positive integer n, let k = [log2 ?^J = [(logn)/(log2)J and 
U = {0, 1}^'^ the set o/0, 1-sequences of length 5k. If n is larger than an appropriate 
absolute constant then there exists V , \V\ > n such that any two sequences in 
V differ in more than k coordinates. 

Proof. We may construct the required set V = {vi,V2, ■ ■ ■} by the following proce- 
dure. Let vi £ U he arbitrary, li vi, . . . ,Vm are already defined then consider the 
balls Bk{vi) of radius k around the fj, i.e., Bii.{vi) consists of those elements of U 
that differ from Vi in at most k coordinates. If 

m 

Um:=U\\jBk{vi)^(i) 

i=l 

then we define Vm+i as an arbitrary element of Um', if Um = then we stop the 
construction of V. 

Using (j3.7p . we can estimate \Bk{vi)\ as 

= < + < + 1)^^ (S)' < 

where the last inequality holds for n larger than an absolute constant. Therefore, if 
m < 2*^+1 then Um + 0, proving that \V\ > 2'=+! > n. □ 

The following lemma is the main step toward answering the question raised at 
the beginning of the subsection. 

Lemma 4.5. Let A C Sym([n]) with A = A^^, e £ A and {A) = Sym([n]) or 
Alt([n]). Then there are g G A"", h G ^["""""^"J such that the action of {g,h) on [n] 
has at most 472(logn)^ orbits, provided that n is larger than an absolute constant. 

Proof. Let k = 5[(log n)/(log 2)J . By Lemma 14.31 there is an element g G A^ with 
I supp(5f)| = an > n/2. Let e = 1/n and £ = [277,^*^ log(n^'^/e)] . Let h G A^ he the 
outcome of a random walk of length i as in Lemma 14.21 
Consider all words of the form 

f{a) = hg''^hg''K..hg''\ 

where a = (ai : 1 < i < k) runs through all sequences in U = {0,1}'=. For /3 G [n], we 
wish to estimate from below the length of the orbit (3^^'^'^ by counting the number 
of different images //3(a) : = for a E [/. 

-> 

To this end, for fixed elements a = (ai, . . . ,ak) and a' = {a[,. . . ,a'^) in U and 

P G [n], we wish to bound from above the probability that fjs (a) = fp{a'). We will 
do this by examining all possible trajectories (/3i, . . . , /3fc), {fi'i, . . . , where 

/3i = /S'^^"^ , h = P^Y" , . . . , = and P\ = fi^^'' , . . . , = fi^^l , 

counting how many satisfy 7^ and then estimating the probability (for h 
chosen randomly in the manner described above) that such a pair of trajectories be 
traversed following /(a) and f{a'). 
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Let R = {1 < i < k : ai ^ a'-}; let the elements of ii be A;i < k2 < ■ ■ ■ < 
kr, where r = \R\. Let vq < r he fixed. Let k' = kr^. Consider all tuples 
(/3i,/32,...,/3fc,/3^,,...,/3^) G such that 

(a) /3i, /32, . . . , Pk'-> ' ' ' •> l^'k distinct from each other and from /3, 

(b) /3f"\/3f /3f""\ (/3^,+i)»""''+\...,(/3^)f""'= are distinct from each 
other, 

(c) h-j i supp(g) for every j < tq, but 13k' G supp(5f), 

(d) W'^' =pC. 

The number of such tuples is at least 

(ro-l \ 2k-k' 

n (n - 1 supp(5)i - i) • (I supp(5)i - 1) • n - - 

where we count tuples by choosing first G [n] \ supp(5) for 1 < j < ro, then 
G supp(5), then the other Pi and /3j'. To justify the estimate on the number 
of choices at each stage, notice that at the j*^ choice with j < — 1 we have to 
make selections from [n] \ supp(g() so as to satisfy (jc]) while keeping them different 
from previous selections and from /? (to satisfy (jaj)). Then (3k' can be chosen as an 
arbitrary element of supp(5() different from j3. At this point, (jb|) is still satisfied 
automatically. At later choices, if /3j or 13'- is selected at stage j then enforcing (jaj) 
eliminates j possibilities and enforcing (jb]) eliminates j — 1, not necessarily different, 
possibilities. Note that (j4.ip also gives a valid lower estimate (namely, 0) in the case 
when To — 1 > n — I supp(g)|. 

By Lemma l42] (with 2k instead of A;, and with properties (jaj), (jb|) as inputs), the 
probability that a random h ^ satisfies 
(4.2) 

(/3, /3i, . . . , /3k-i,f3'k, , . . . , /3Li)^ = , , f3C' , iP'k'+if^'"^' , . . . , WkY'^'" ) 

is at least (1 - e) ("-(2^-^'))! > (i _ e)^^^. If /i satisfies (gj) then /S^s"' = /3i, 
/jj^^-^a = /32,. . . , = By properties (jcj) and (01), we also have p'^^"'' = /3i, 

f3l'^' = /32,. . . , fS^kC = /3f?""" = /3^,; by we also have (/3^,)'^^ ' = (3',,^„ 

a' 

■ ■ ■ 1 {P'k-i)^^ = P'k- Thus, in particular, any two distinct tuples 

(/3i,/32, . . . . . . ,/3fc) 

give us mutually exclusive events, even for different values of tq. Note also that, by 
property (jaj) and what we have just said, //?(«) = / = //3(a')- 
Hence the probability P that /^(a) ^ fpio,') is at least 

ro=l 



^ ^ ;;2irA- • [wu j • («n - 1) • n;:;:^) in - {2j - 1)) 
(4.3) > e;.i (1 - ^) (1 - f (« - ^) • n;ri^ (i - « - ^ 
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If an = I supp(g)| > n — k then we estimate P from below by the summand ro = 1 
in (|4.3p . yielding 



P>n-iVi-^5V7a-lUi-i-^-^>i 



2k 

n J \ n J \ n J ' n n n ' n 

with the last inequality valid if n is larger than an absolute constant. 

If an = I supp((7)| < n — k then, estimating the terms (1 — a — j/n) in the last 
product in (|4.3p from below by (1 — a — k/n), we obtain 

^ ^ ' ^ n) n ) (" n) " n 

' ' iiy) V n/(l-(l-Q - ()c/n)) 

n / a + (fc/nj 

Since a > 1/2, we have "^j^ff > 1 - and (1 - a - (fc/n))^' < (1/2)'-, implying 

4/c(2/c + l) 2(/fc + l) 1 9k^ 1 

P > 1 ^ ^ > 1 , 

n n 2^ n 2^ 

if n is larger than an appropriate absolute constant. 
We conclude that, for any two non-identical tuples 

a = {ai,...,ak) G {0, l}^ a' = (a;,...,4) G {0,1}^ 
and for any /3 G [n], 

Prob(/3''3""'3''^-''3"'" = I3hg''ihg''2...hg''k^ < — + 



where r(a, a') is the number of indices 1 < j <k for which Oj 7^ a^. 

By Lemma 14. 4| there exists a set V of more than n tuples so that any two 
tuples differ in more than /c/5 coordinates. For fixed /? G [n], writing //3(a) = 
j^hg'^'i-hg'^2...hg°-k ^ g g y foj- j^j^g random variable (5 1— )• /^(a) defined using a random 

/i G A^, we obtain that 

nm,a')eV^:fp{a) = fp{a')}\)= ^ Prob(/;3(5) = //3(3')) 



< 1^1 + — + — T-7 I^KIV^I -1)<^+ — + TTfc7^ l^ 

n 2''(''''' ) / n \ n 2^1^ J 

|T/|2 

< (9A;2 + 3)^ < 472(logn)2^, 
n n 
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Concerning the length of the orbit /J^^''^^ we have 



E < E 



\p{9,h)\) - \\{fp{a):deV}\ 



|yp J - n 

where we use the Cauchy-Schwartz inequahty (or the inequahty between the arith- 
metic and quadratic means) in the second step for the numbers rrii that measure 
how many times a particular value 7^ occurs among the f/sia), for some a £V. 

Now, Yli3ein] 1/1/5^^''^'* I is just the number of orbits of {g, h) (since each such orbit 
contributes \l3'^3,h) \ . i/\p{9,h) \ = 1 to the sum). Hence 

E(number of orbits of (g, h)) < 472(logn)^. 

In particular, there exists an h ^ such that the number of orbits of {g, h) is at 
most 472(logn)^. Since I = [2n^'^ log(n^'^/e)] is smaller than 71^^'°^" for n larger 
than a constant, we are done. □ 

Proposition 4.6. Let A C Sym([n]) with A = A~^ , e ^ A and (A) = Sym([n]) or 
Alt([n]). Ifn is larger than an absolute constant, then there are 51, 52) 53 £ ylL"**'"*^"] 
such that ((71,(72,53) is transitive. 

Proof. Let g, h be as in Lemma 14.51 Let e = i = [2n^ log(n^/e)] . Let 

g' £ A^ be the outcome of a random walk of length £ as in Lemma 14. 2i Note that 
£ < [n'^'*^°s"J for n larger than an absolute constant. 

Let A be the union of orbits of (5, h) of length less than ^/n. Since, by Lemma 
14.51 there are at most 472(logn)^ orbits of (g^h), we have |A| < 472-y/n(logn)^. Let 
S" be a set consisting of one element a of each orbit of length less than ^/n. Then, 
for each a £ S, Lemma 14.21 implies that 

/ .\ . x|A| / 1 \ 472(logn) 
Prob G A < 1 + e)^ < 1 + ^ ^ ^ 



and so 

(4.5) Prob ((3a E S) (a^' E A)) < (l + 

Let K, be an orbit of {g, h) contained in n \ A; by definition, \k\ > ^Jn. Let kq be 
the largest orbit; by the pigeonhole principle, |/to| > 472(iogn)^ • '^^^'^ 

'^ol nI^II^oI 



^{\k^' n Kol) = ^Prob(af' G kq) > J^(l - e)— = (1 - e 



n n 
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whereas 

= ^ Prob(af' G Ko) + Yl PTOb((Q, /3)9' = (q', 



< 



^ n ^ ^ n(n - 1) 



Thus 



<(! + (.)( ' MN_ _^ - 1)\ko\{\ko\ - 1) 

~ ' n n{n — 1) 

<(l+.)('^ + ^^ 



Var(|K^ nKo|)=E(|K^' r^ kq^) - n/col) 

< (1 + .) f ^ + ^4^1 - (1 - ef\^^ 



\ n 



n \ n / n 

By Chebyshev's inequahty, 

Vard^s' n Kol) 



Prob(K^ n Ko = 0) < 



^ (|K||Ko|/n)(l + 4/n) _^ 12n 12-472(logn) 

(1 _g)2 l«PK)P - \k\\ko\ ^ 

Hence 

(4.6) Prob ((3k C ([n] \ A)) (k^' n ^cq = 0)) < i^'^^^^^^oS n)^ 

Now, for n larger than a constant, 

1 \ 4722 (log nY 12 • 472^ (log 
1 + — V + r- < 1- 



n 



Therefore, ()4.5p and ()4.6p imply that with positive probability, (a) intersects 
[n] \ A for every orbit k not contained in [n] \ A and (b) intersects kq for every 
orbit K contained in [n] \ A. In particular, this happens for some g' G A^. Properties 
(a) and (b) imply that {g, h,g') is transitive. We set gi = g, g2 = h, gs = g' and are 
done. □ 
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We will later us^ the following corollary with k = 2. 

Corollary 4.7. Let A C Sym[(n]) with A = A~^ , e e A and {A) = Sym([n]) or 
Alt([n]). Let k > 1. If n is larger than a constant depending only on k, then there 
is a set S Q ^L"''^'°^"J qJ gi^e at most 3k such that {S) is k-transitive. 

Proof. Let ai G [n] be arbitrary. Since (^4) is transitive, Lemma 13.91 implies that 
af" = [n]. Let G = Sym([n]), H = Ga^, A! = A^ . Since = [re], A! intersects 
every coset of U in G. By Schreier's Lemma (Lem[3?8l), it follows that (A')^ n H 
generates (^) n which is either Sym([n] \ {ai}) or Alt([n] \ {ai}). Let A\ = 
{A!f n U. 

Iterating, we obtain a sequence of sets = A, Ai, A2, ■ ■ ■ , A^-^i C Sym([n]) 
and a sequence of elements ai,a2j • • • S N such that Ai C Af"^-^ and {Ai) is 

Sym([re] \ {ai,. . .,ai}) or Alt([n] \ {ai, . . . ,ai}). 

Let {gi)i, (52)1; (53)4 be as in Prop. 14.61 applied with Ai instead of A. Then 
(51)., {92h {93h e ^(3-)Hn^^'°-J and ((51),, (52)., fe)^) ^ Sym([n] \ {ou- • • , is 
transitive on [re] \ {ai, . . . , a,} for < i < A: — 1. Thus, for 5 = IJi=o i'^) 
/c-transitive on [re]. □ 

5. The splitting lemma and its consequences 

We will prove what is in effect an adaptation of Babai's splitting lemma (proven 
for groups in |Bab82l Lem. 3.1]) to the case of sets. This is a key point in this paper: 
the splitting lemma will allow us to construct long stabiliser chains with large orbits. 

The following easy lemma will make an "unfolding" step possible. 

Lemma 5.1. Let A C Sym([n]), S C [n] and g G Sym([re]). Then 

Proof. We have Sym([n])(29) = Sym([re])(s)9- Therefore, 

^(ss) = AnSym([n])(S9) = ^nc/~^Sym([re])(s)ff 

= g~\gAg-^ n Sym([n])(s))5 = 9'\9Ag'^)(^E)9- 

□ 

Notice a feature of the following statement - there is a high power of A in the 
assumptions, not just in the conclusion. We will "unfold" the high power of A in 
the course of the proof. (By T,^ we mean the set S'^ = {a^ : a G T,,g G 5}.) 

Proposition 5.2 (Splitting Lemma). Let A C Sym([n]) with A = A~^ , e ^ A and 

{A) 2-transitive. Let S C [re]. Assume that there are at least pn{n — 1) ordered 
pairs (a,/3) of distinct elements of [re] such that there is no g & (^^5" ^°S"J)^^^ with 
a3 = (3. Then there is a subset S o/^L3n«i°s"J with 

{AA-^)^j,s) = {e} 

'''if we wished to, we could use it to obtain a set S of generators of Alt([n]) or Sym([n]) simply 
by setting fc = 6: the Classification of Finite Simple Groups implies that a 6-transitive group must 
be either alternating or symmetric. 
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and 



\S\< 



• loE n 

log(3/(3 - 2p)) ^ 



Proof. Set £ = \2n^ log(n2/(l /3))] ; note that £ < [3n^ log nj and 2£+2 < [5n^ log n\ 
for n > 5. (For n < 5, the statement is trivial.) By Lemma 14.21 applied with 
k = 2 and e = 1/3, we obtain that given any two distinct elements a,/3 E [n] 
and g G A^, the pair (a^,/3^) adopts any possible value (q',/3') with probability 
at least (1 — l/3)/(n(n — 1)), where we choose g ^ A^ with the distribution in 
Lemma 14.21 {g = gig2 ■ ■ ■ g£, 9% chosen independently from A' U {e}, where A' is a 
symmetric subset of A). Since this distribution is symmetric, this is the same as 
saying that (a^ \/3^ ) adopts any possible value {a',/3') with probability at least 
(l-l/3)/(n(n-l)). 

Now, given (a,/3) and g G A^, we have h G (AA ^)ce3) ^^'^ = f3 if and only if 
ghg-^ G g{AA-'^) 1^^:3)9''^ and (af"')^''^"' = By Lemma O applied to AA''^, 

we have that ghg~^ G g{AA~^)(^^g-^g~'^ only if ghg^^ G {gAA^^g^^)i^Y,-^, which in turn 
can happen only if ghg^^ G (^^^"''^){s)- Thus, if there is no element j G (^^^'^^)(s) 
with = [3^ ^ , then there is no element h G {AA~^)(^j^g-^ with = p. (This is 

the "unfolding" step we referred to before.) 

Since by hypothesis there are at least pn{n — 1) ordered pairs {a',/3') such that 
there is no element j G (^^^^^)(e) with a'^ = /3', and since {a^ , /3^ ) equals any 
such pair with probability at least (2/3)/(n(n — 1)), we see that the probability that 
there is no element h G {AA~^)(^^g-^ with a'* = /3 is at least 2p/3. 

Let 5 be a set of r random g € A^ (chosen independently, with the distribution 
as above). The probability that for every g £ S there is an element h G {AA~^)(^-£g-^ 
with a^ = /3 is at most (1 — 2p/3)'". This must happen if there is an element 
h G {AA~^)j^s such that a^ = /3. Thus, the probability that there is such an h is at 
most (1 — 2p/2>Y , and the probability that there is such an h for at least one of the 
n{n — 1) pairs {a,/3) is at most n{n — 1)(1 — 2^/3)*^. 

Setting r = [(log n^) /(log 3/(3 — 2/3))], we obtain that the probability that there 
is such an h for at least one pair is less than 1. Hence there is a set S ^ A^ with at 
most r elements such that, for every pair (a,/3) of distinct elements of [n], there is 
no /i G {AA'^^)(Y,s-^ with = /3. This implies immediately that the only element of 
(AA~^)(-2S) is the identity. □ 

Corollary 5.3. Let A C Sym([n]) with A = A~^ , e £ A and (A) 2-transitive. Let 
A' = ^L5n«iognJ _ 5] c [n] be such that 

|q,^'{e)| < (1 -p)n 

for every a G [n], where p G (0, 1). Then 

log 1^1 



S > 



• logn 



log n 



log(3/{3-2p)) 

Ln particular, if p = 0.05 then |S| > (log |A|)/(60(log n)^). 
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A' 

Proof. Since \a (^)| < (1 — p)n for every a G [n], there are at least pn{n — 1) 
tuples (a, /3) such that there is no 5 G A' with = (3. By Prop. 15.21 there is 



a set 5" C Sym([n]) such that {AA ^){t.s-^ = {e} and \S\ < iog(3/(3^2p)) "loS'^ 
Since {AA~^)(y.s-) = {e}, we know, by Lemma [3.171 that jS"^! > log„|A|. Clearly 
< Hence 

log^^ log 1^1 



1^1 



log(3/(3-2p)) 



log n 



log n 

□ 



A key idea in the proof of the Main Theorem is the following. For A C Sym([n]), 

we can construct A' = ylL^" lognj ^ = {ai, 02, ... } ^ [n\ starting with an 

(A') 

empty set and taking at each step Oj to be an element such that ^' ' ' | > 

(1 — p)n (say); if no such element exists, we stop the procedure. By Cor. 15.31 1^1 
must be large. 

An apphcation of Lemma IXTOl will give that, for A" = {A'Y^'^^ , the set (^")s 
contains a copy of Alt(A), where ACS and |A| > (1 — Such a large 

alternating group certainly looks like a valuable tool. 

6. Proof of the main theorem 

The core of this section is Proposition 16.51 It is a growth result, but not quite 
of type \A- A- A\ > \A\^+^ or \A^\ > What will grow by a factor at each 

step is not the number of elements 1^41 of A, but rather the length m of a sequence 
ai, . . . , am such that the orbits 

(6.1) "l ,"2 ,"3 am 

are all large. 

This growth result (Prop. 16. 5|) will be applied iteratively. There are two ways for 
the iteration to stop: (a) an element we construct could fix a large set pointwise 
(we call this the case of exit), or (b) a group we work with could fail to have a large 
alternating composition factor. In case (a), we obtain all of G = Alt([n]) in a few 
steps by Thm. 11.41 In case (b), we can descend to the problem of proving small 
diameter for n' smaller than n by a constant factor. (Here, as is "infinite descent" , 
the term "descent" means the same as induction, seen backwards.) 

* * * 

Let us sketch briefly the proof of Prop. 16.51 First, we use (16. Sp to construct 
many elements in the setwise stabiliser Gs, where S = {ai, . . . , am}', in fact we get 
an entire copy of a large alternating group in (Crs)|E (Lemma I3.19p . This is the 
setup. Then comes the creation step: we use the action by conjugation of Gy, on 
the pointwise stabiliser to construct many elements of (Lemma 16. ip . We 
organise these new elements (all in a power A' of A) as follows: we apply Cor. 15.31 
(a consequence of the splitting lemma) to lengthen our stabiliser chain A' 5 A'^_^ 5 
. . . ^ A', ■, D . . . UT) to A', s in such a way that the orbits (defined as 
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in (16. ip ) are still large. We repeat the organiser step about » (log n)/(log m) times. 
There are only two ways for this procedure to stop prematurely, namely, exit and 
descent (cases (a) and (b) discussed above). 

* * * 

We start by proving the lemma containing the creation step: we give a way 
to construct many elements in a subgroup H~ of a group G. The basic idea is 
the application of the orbit-stabiliser principle to the action by conjugation of a 
subgroup < Ng{H^) on H~ , where Ng{H~) is the normaliser of H~ . 

Lemma 6.1. Let G = Sym([n]) or Alt([n]), R- < G, H+ < Ng{H-), T an orbit 
of . Let Y = ^ H~ he such that (l^)|r is 2-transitive on T. Let 

B C Then either 

(a) there is a b £ BB^^ \ {e} fixing T pointwise, or 

(b) \B-'^YBnH-\ > \B\^/\ 

Proof. Consider the action of i? on y = (yi, . . . ,yr) by conjugation: for b £ B, we 
define y* := (yj, . . . , y^), where y'^ = b~^yb. Assume first that there are two distinct 
elements 61,62 G B such that y*^|r = y*^|r- Then 6163 ^|r centralises y|r, implying 
that 6i6^^|r G Z({Y)\y') = {e}. Hence 616^^ G B fixes T pointwise without being 
the identity, i.e., conclusion @ holds. 

Assume now that the restrictions y*|r are all distinct. Hence, by the pigeonhole 
principle, there exists an index j £ {l,...,r} such that the set W of conjugates 
of yj by B satisfies |Ty|r| > Observe that all elements of W are in H~ , as 

Y C H~ and B C N{H-). Hence \B-^YB nH~\>\W\> □ 

The following useful lemma is in part an easy application of Schreier's lemma 
and in part a consequence of a trick based on the following trivial fact: one clearly 
cannot have two disjoint copies within [n] of an orbit of size greater than n/2. 

Lemma 6.2. Let A C [n]. Let B+ C (Sym(n))A with B+ = {B+)-^ , e G B+ . 
Assume 5+|a is Alt(A) or Sym(A). Let B~ = ({B'^f)^^y 

Then {B~) = (-B^)(A) O {B~^)- Furthermore, if {B~) has an orbit T of length 
greater than n/2, then T is also an orbit of (B^). 

Proof Since B+\a is a group (Alt(A) or Sym(A)), B+\a = {B+)\a. Thus B+ 
contains an element from every coset of (-B''')(a) iii {B~^) and so, by Lemma 13.81 
S~ contains a set of generators of (-B'^)(a)- Hence (B^) = (-B+)(a)- In particular, 
(B^) < {B^), as (-B"*")(A) is the kernel of the action of on A. 

The orbits of the normal subgroup {B~) <1 (-B"*") are blocks of imprimitivity for 
{B~^). Since one cannot have two blocks of length greater than n/2, {B'^) leaves T 
invariant as a set, and so T is an orbit of {B^). □ 

Our descent step rests in part on a simple calculation. 

Lemma 6.3. Let f{x) = e(i°g^)*9W, where k eZ+, t>0, and g : [t,oo) [0,oo) 
is an increasing function. Let < do < 1 and 61,62 > be given with 61 + 62 < 
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klog{l/6o). Let h : [t,oo) ^ [0,oo) be a function with h{x) < e^2(iogx)>^-^g{x) _ j^f^^^ 

(6.2) f{6ox) • h{x) < f{xf'^ 

provided x is larger than a bound depending only on k, t, 5q, 5i, and 62- 

Proof. Since the derivative of the function j{z) := (1 — z\og{l/ 5q))^ at z = is less 
than —{5i + 82), there exists an interval (0, e), with e depending on k, (^Qj ^i-, and 
82, such that j{z) < 1 — {61 + 62)z holds for all z G (0, e). Hence, if x is large enough 
that 1/logx < £ then 

(iog(*.))' = (log.)' . i (jl^) < (log.)' (1 - Mii) , 

If in addition Sqx > t so that f{5ox), /(x), g{x), and h{x) are defined then 

f{6Qx) = Qi^°sSox)'^giSox) ^ g{log(5ox)'=g(x) ^ g({logx)'=-{5i+<52)(loga;)'=-i)g(a;) 
< (/i(x))-ie-'5i(l°g^)'^'^'(^) -/(x) = {hix))-^f{x)^-^. 

□ 

The following lemma is also crucial to the descent step. In the proof of the lemma, 
we use Lemma [3.20l to guarantee the existence of an element that we then construct 
by other means. 

Lemma 6.4. Let G = Sym{[n]) orAlt{[n]). Let A <Z [n], \A\ > {log . Let A (1 G 
with A = A-^, e e A and {A) = G. Let B+ C I > 1, with = {B+)-\ 

e G B+. Assume B+\ A is Alt (A) or Sym(A). Let B' = {{B+)^) Assume {B') 
has an orbit T of length at least pn, for some p > 8/9. 

// all alternating composition factors Alt(A;) of (B^) satisfy k < 5n, where (5 > 0, 
and 

(6.3) maxdiam(Alt(A;)) < Ds, 

k<5n 

for some > 0, and n is larger than an absolute constant, then 

^Lfe^('-")^i?,J ^ Alt([n]), 
where c = c{p) depends only on p. 

Proof. The group U := {B^)\r is transitive. It is also isomorphic to a factor group 
of {B~), so U also has no alternating factors Alt(A;) with k > 5n. By Thm. 11.11 and 
by (j6.3p . there exists an absolute constant Ci such that for 

(6.4) u := [e^i(i°g")' . Ds\, (5")"|r = U. 

Let H = (B^). By Lemma 16. 2^ T is an orbit of H. If n is large enough that 
Lemma 13.201 applies then there exists g & H of support less than \T\/4 on T. Take 
h £ B+ with /i|a = g\A- Then gh^^ G (^^)(A) = (^~> and so, by ([63]), there 
exists b G {B-^ with gh'^lr = 6|r- Therefore, bh G (S+)3"+i satisfies bh\r = 
g\r. Since g fixes at least (3/4)|r| > (3/4) • pn > (2/3)n points in F, we have 
|supp(6/i)| < (1 - (3/4)p)n < n/3. By Thm. Ol (A U {6/i, (6/1)-^})^"" contains 
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Alt([n]), where K = K{e) (e = 1 - (3/4)/) < 1/3) is the number defined in Thm.O 
Since A U {hh, (6/i)"H C ^(3"+!)'^ ^e are done. □ 

We come to the key result in this section. 

Proposition 6.5. Let G = Sym([n]) or Alt([n]). Let A (Z G with A = A^^ , e £ A, 
and (A) = G. Let ai,a2, ■ ■ ■ , Om+i G [n] be such that 



.5) 



> — n 
- 10 



for every i = 1, 2, . . . , m + 1. 

There exist absolute constants C > 1, ci,C2,C3,C4 > with 

(6.6) C4/C3 < C2 < 4(log l/0.95)ci 
such that the following holds: if m > (logn)^, then either 

I (7pCi(logn)*loglogn-C2(logn)'^ loglogn I 

(6.7) Al J D Alt([n]) 

or there are 0^+2) "m+Si • • • ^ctm+e+i ^ [n], cs(m log m)/ (log n), such that 



9 

> — n 
- 10 



(6.8) 

for A' = ^L'^''"*'°^"J and every i = 1,2, . . . ,m + £ + 1 



The condition C4/C3 < C2 < 4(log l/0.95)ci is needed for the recursion to work. 
As we will see, this inequality is easy to attain; we are not taking advantage of any 
numerical coincidence. We may choose ci = 50182, C2 = 10296, C3 = 0.0745, and 
C4 = 767. 

An easy application of Proposition 16.51 proves Corollary 16.61 (which is equivalent 
to our Main Theorem) . Conversely, in order to prove Proposition 16.51 we will use 
Corollarv 16. 61 for smaller values of n in an inductive process. In the proofs of Prop. [631 
and Cor. 16.61 we assume that n is greater than a well-defined (but not explicitly 
computed) absolute constant uq; we need that n is greater than an appropriate 
constant so n satisfies some inequalities we introduce during the proofs, and also 
so that results from the previous sections, valid only for sufficiently large n, can be 
applied. Once uq is defined, the constant C in the statement of Prop. 16.51 can be 
adjusted so that ()6.7p holds trivially for n < uq. 

Corollary 6.6. Let G = Sym([n]) or Alt([n]). Let Y C G with Y = y-\ e £Y 
and G = {Y). Then diam(r(G,y)) < C'e^i{iog")^iogiog«^ where G and d are as in 
Proposition \6. 



The proof consists just of a repeated use of Proposition [631 plus some accounting. 

Proof. We can assume that n is large enough that mo < O.ln < n — 3 for mo = 
[(logn)^J + 1, and so G acts transitively on the set X of all (mo + l)-tuples. Hence, 
by Lemma 13.91 the set := y"™'^^ D yi^l acts transitively on the set of all 
(mo + l)-tuples. Thus (16. 5p holds with Aq instead of A, mo instead of m and ai = i 
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for i = 1, 2, . . . , uiQ + 1. We apply Proposition 16 . 51 with these parameters. We obtain 
either ^1} or ^6^ . 

In the latter case, we set io = i, mi = tjiq + io, and iterate: we apply Proposi- 
tion 16.51 to 

Ai = AIA2 = A\ = A3 = Al = . . . 

where r = [n'^"*'°§"J. (After each step, we "save" the output (. to and set 
mj+i = mi + li .) We stop when we obtain (|6.7p : say this happens when we apply 
Proposition 16.51 with A = Ak = Aq . Clearly rrik < O.ln (as otherwise (16. 5p could 
not hold for i = ruk + !)• 

It remains to estimate k. For 1 < j < logn, let tj be the largest index i between 
and k such that rrii < e^; if no such index exists, set tj = 1. We have mo > 3 and 
so ti = 1. By Proposition 16.51 mj+i > (1 + (03 logmj)/(log n)) -mi for all i < k, and 
so, by (1 + C3i/(logn))L(i°g")/(c3i)J+2 > e, we have tj+i < tj + [(logn)/(c3j)J + 2. 
Thus 

[lognj-l , , 

*LiognJ<ii+ X] itj+i-tj)<l+ ^ ( ^ + 2 j < cslognloglogn 

j=l l<J<logn ^ '^^^ ^ 

for any C5 > I/C3, with the last inequality valid if n is larger than an appropriate 
constant. Set C5 = (I/C3 +C2/c4)/2 (say); note that (j6.6p ensures that 02/04 > C3, 
and so 02/04 > C5 > I/C3. Since t[iognJ + 1 > A; (because rrik < O.ln), we get that 
k < log n log log n. 
Thus 

. .^k j^[(logn)2j +2.^[C5 logn loglog7iJ I (log Ti)-^ log log n I 



for any cg > C4 • C5, if n is large enough in terms of oq. Set cg = (c4 • C5 + C2)/2 (say), 
so that Oq < 02- Then, by (j6.7p (valid for A = A^), we obtain 



Alt([n]) C (Y 



[Ce'^i('°g "'^ '°g l°g"J — 1 



C Y 



for n larger than a constant. If y C Alt([n]), then y LCe''^*'°^"^ logiognj _ ^^^Q^-j-j^ 
If Y contains an odd permutation then y LCe^^ '°^'°^"J _ Sym([n]). □ 

We finally turn to the proof of Proposition 16.51 In what follows, let 

(6.9) ^gCi(logx)4loglogx 5(x) = CiloglogX. 

Proof of Proposition [675[ We prove Proposition 16.51 by induction on n. During the 
proof, in the inductive hypothesis, we will make use of Cor. 16.61 for values smaller 
than n. 

We can assume that n is large enough so that m > (log n)^ > C(0.9), where C(0.9) 
is as in Lemma l3.19i Apply Lemma 13.191 with d = 0.9 and S = {ai, . . . , am}- We 

obtain a set A C S such that lAI > 0.9|S| and ( ( A^'^'^''') ) I a contains Alt(A). 

VV /S/{E\A) 



ON THE DIAMETER OF PERMUTATION GROUPS 



31 



Let 



= , ((A--) J^^^^^ : € A1.(A)} , B- = (iB-f) 



(A) • 



This is our initial setup: we have a large set in the setwise stabiliser Gs; fur- 
thermore, we have constructed a large subset ACS such that C (Gs)(e\a) ^"^^ 
= Alt (A). We also have a set B~ in the pointwise stabiliser G(Y,y By (j6.5p 



with i = m + 1, 



"m+l 



> ^n, and so {B ) has an orbit T of length at least 0.9n. 

By Lemma EH L is also an orbit of {B^). 

We would like {B~) to act as an alternating or symmetric group on T; let us show 
that, if this is not the case, we obtain descent. We may assume that Corollary 16.61 
holds for n' < n (inductive hypothesis). Hence, if {B~) has no composition factor 
Alt(fc) with k > 0.95n, then Lemma [6^ (descent) gives us 

^Ll6m«e^('°^")'-C/(0.95n)J ^ Alt([n]), 

for n larger than an absolute constant, where c = c(0.9) is from Lemma 16.41 and 
fix) is as in 1^. We apply LemmaEHwith k = 4, h{x) = IGx^e'^^^^s^)'^ = o.95, 
5i = C2/C1, 82 G (0, 41og(l/(5o) — 5i) arbitrary, and t large enough that h{x) < 
exp((52Ci(loga;)'^loglogx) holds for x > t. Weobtainthat 16m^e'^('°s"')^-C/(0.95n) < 
C/(n)-'^^('^2/ci)/(ioga;) £qj. ^ larger than a constant, and so (j6.7p holds and we are done. 

Thus, we can suppose from now on that {B~) does have a composition factor 
Alt(fc) for some k > 0.95n. The only orbit of {B~) that can be of length at least k 
is r, so {B~)\y = (-B~|r) must contain Alt(A;) as a section. Hence, by Lemma 13.111 
{B~\r) > Alt(r). (We can assume 0.95n > 84, and thus Lemma [3 . 1 1 1 does apply.) 
Note we also get that |r| > 0.95n. 

Now that we know that (i?"|r) > Alt(r), Corollary 14.71 gives us a small set of 
elements Y = {yi,y2, ■ ■ ■ ,2/6} ^ (iJ~)L"'*^'°^"J such that (^)|r is 2-transitive on F. 
We apply Lemma \6.1\ ( creation) with H'^ = (B^), = (B^), B = B^ and r = 6. 
(The condition < is fulfilled thanks to Lemma 16^ ) 

If conclusion ^ in Lemma l6.ll holds, then there is a 6 G B~^{B~^)~^ \ {e} with 
supp(6) < 0.05n. Thm. 11.41 thus gives us that {A U {6})-^'^" D Alt([n]), where 
K = i^(0.05) is an absolute constant. Assuming that n is large enough to satisfy 

2 • 48m^ • Kn^ < 96Kn^^ < [exp (ci(logn)'^loglogn — C2(logn)'^loglogn)J, 

we obtain that ()6.7p holds and we are done. (This is what we call an exit from the 
procedure.) 

We can thus assume that conclusion (|b]) in Lemma [6.11 holds . i.e., we have created 
a set = {B+)-^YB+ n (5") with \W\ > |5+|V6. Note that {B+)-'^YB+ C 
^Ln«i°g"j ^fp^ ^ i^j.gg^ ^y^^^ ^ constant) and > | Alt(A)| = (1/2)|A|! > m^-S^^™ 
(for m larger than a constant; recall that |A| > 0.9m). Hence 

(6.10) ^Ln*«'°^"Jn(B 



Now that we have created many elements in the pointwise stabiliser of S, it is our 
task to organise them: we wish to produce 0^+2, • • • , ctm+i+i satisfying (|6.8p . 
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For z > 0, we define recursively Ai,Bi C (^4) and a sequence of points in [n]. 
Let ^0 = """K mo = m, Sq = (ai, . . . ,0^0+1), and Bq = (Ao)(Eo\{a„(,+i})- 

If Ai,Tii,Bi are already defined then let A[j^i = iog"J ^j^j be a 

largest possible extension Sj+i = (ai, . . . , Omi+i+i) of Sj = (ai, . . . , arm+i) such 
that 



> 0.9n, 



(6.11) 

for all J = 1, 2, ... , mj_|_i + 1. Finally, let 

^i+i = i^i+iT^"^ and = 



Note that for all i > 0, (Sj) has an orbit Fj of length at least 0.9n because 
0.9n. 

We stop the recursion, and set w := i for the last i for which Ai is defined, if 
either 

(a) |i?i|r,;| < \Bi\, i.e., there are two elements ^1,62 G i?j such that 616^^ fixes Fj 

pointwise; or 
{b) \Ti\ < 0.95n or {Bi\r ^) ^ Alt(Fi) or 
(c) n™'-™o > 
By (fCTjl . we have |So| > mO-i^S'". 

First, we estimate the differences mj+i — mi. If the recursion did not stop after 
the definition of Ai,Bi, and Sj then, in particular, the stopping criterion (c) is not 
fulfilled at step i. Lemma [3^ applied with {Bq) as G, G(Sj\So) as iJ, and Bq as ^4, 
then implies that 

\Bi\ > \Bi nH\> > \/m0-i49"^. 

Also, by the criteria (a) and (6), we have ji^jlrj = \Bi\ and (i?i)|r, acts as Alt(Fj) 
or Sym(Fj) on Fj, where |Fj| > 0.95n. 

Since 0.9n < 0.95 • 0.95n < 0.95|Fj|, we can apply Corollary 15.31 with p = 0.05, 
Bilr^ instead of A, and Fj instead of [n], and obtain that, for 1 < i < w, 

/cioN ^ logl^il csmlogm 

60(logn)"' 60(logn)^ 

where we define C3 := 0.149/2 = 0.0745. (This is what we have called an organiser 
step. It is ultimately based on the splitting lemma (Prop. I5.2|l . of which Cor. 15.31 is 
a corollary.) 

At the same time, < ^/rnP^^^ imphes 

cslogm 

ruw-i — rriQ < — m. 

log n 

Since — rn-o = J27=iimi — mj_i), from (j6.12p it follows that 

cslogm csmlogm 

— j m > [w - Ij^TTTj Tn 

logn 60(logn)^ 
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and we conclude that w — 1 < 60 log n. Hence 

^ ^ ^L5n''lognJ'"(48n6)- ^ ^ [n« l°g "J ■ [240ni2 log nj- ^ ^[n'=4log"J 

for C4 := 767 > 46 + 12 • 60, provided that n is larger than an absolute constant. 
If n™-™"™-" > y/mP^^^ (stopping condition (c)), then 

cslogm 

rriu] — mo > — m, 

logn 

and so, setting i = rriw — tuq, we obtain ()6.8p . (In other words, our repeated 
organising has succeeded.) 

If we stopped because condition (a) holds then contains a non-trivial element 
6162"^ with support less than O.ln. By Theorem [Ol (^ U {hb^'^})^'^'' D Alt([n]), 
where K = K{0.1) is an absolute constant. Assuming that n is large enough to 
satisfy 



2- [n'^^lognj .^^8 ^ 



^ci{logn)'^ log logn— C2 (log n)-^ log logn 



we obtain (j6.7p . (This is an exit case.) 

Finally, suppose we stopped in case (6), i.e., {Bwlvni) 7^ Alt(r^) or |r^| < 0.95n. 
As > m > C(0.9), we can apply Lemma [3.191 with T,w as S and A'^ as A, to 
obtain C S^, |A^| > 0.9|S^| and B+ C {A'J^^"^" such that B+ fixes \ 
pointwise, fixes A^ setwise, and B^\^^ = Alt(A^). (This is a fresh setup.) Also, 
by LemmaO fi" = ((^^)^)(a„) generates (5+)(a,„) < (^i)- Note that fi" C 5^ 

and {B^) has an orbit of length at least 0.9n, because B:^ 5 '^m^+i- 

We are ready for another descent. The group (B^) has no composition fac- 
tor Alt(A;) with k > 0.95n, because such a factor would be a section of {Byj) and 
Lemma [3.111 would imply that (-B«,|r„) is an alternating group on > 0.95n elements, 
in contradiction with condition (6). Thus the hypotheses of Lemma 16.41 are satisfied 
with 5 = 0.95 and p = 0.9 and, by the inductive hypothesis. Lemma 16.41 gives us 
that 

^^„C4logngC(logn)3.C'j(0.95n)J 3 Alt([n]), 

where c = c(0.9) and / is as in ()6.9p . We apply Lemma 16.31 with k = 4, h{x) = 
y.ci\ogx^c(\ogxf ^ ^ 0.95, 6i = C2/C1, 62 G (0,41og(l/(5o) - <5i) arbitrary, and t 
large enough that h{x) < exp((52Ci(logx)'^logloga;) holds for x>t. We obtain that 
^c4logngc(logn)3 . c/(o.95n) < C'/(n)i-(^2/ci)/(log x) fQj. ^ ij^^ggj. ^i^j^j^ constant, and 

so we obtain conclusion (j6.7p . 

We end by noting that we can set C2 = [C4/C3] = 10296, ci = [c2/(4 log 1/0.95)] = 
50182, and thus fulfih (EH)- □ 



We now use Corollary 16.61 to prove both the Main Theorem and Cor. 11.31 (for 
Sym(n) and Alt(n)) with explicit constants in the exponents. 

Theorem 6.7. Let G = Sym(n) or Alt(n). Then 

diam(G) = o(eci(iogn)4iogiogn)^ 

di^(G) = 0(e(ci+l)(logn)4loglogn)^ 
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where ci > is the absolute constant in Corollary \6.6l 

As we have said, ci = 50182 is valid. 

Proof. Let A be an arbitrary set of generators of G. Let X = AU A^^ U {e}. The 
undirected Cayley graph T{G,X) is just the undirected Cayley graph r(G, ^) with 
a loop at every vertex; their diameters are the same. Thus, by Corollary 16. 6^ 

diam(r(G, A)) = diam(r(G,X)) < Ce'^^ (log '^)' log log". 

By |Bab06[ Cor. 2.3], 

diam(f(G,^)) < O (diam(G)(nlogn)2) < O (^e^^i+mognriogiogn^^ _ 

□ 

Finally, we note that, as Ce'^i '°s " < e('^i+i°gC')(i°g")*i°gi°g", the bounds 
obtained in Thm. 16.71 can be written in the form stated in the Main Theorem and 
Cor.O 
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